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Structural information extracted from the diffraction of XFEL fs-pulses in a crystal 


A. Leonov^, D. Ksenzov^, A. Benediktovitch^, 1. Feranchuk^ and U. Pietsch^ 

^Department of Theoretieal Physies, Belarusian State University, 220030 Nezavisimosti ave. f, Minsk, Belarus 
^ Festkorperphysik, Universitdt Siegen, 57072 Walter-Flex-Str. 3, Siegen, Germany 

We present a theoretical justification for a method of extracting of supplementary information for 
the phase retrieval procedure taken from diffraction of fs-pulses from X-ray Free Electron Laser 
facilities. The approach is based on numerical simulation of the dynamics of the electron density 
in the crystal composed of different atoms in the unit cell, namely a bi-atomic crystal containing 
heavy and light atoms. It is shown that evaluation of diffraction intensities measured by means of 
different values of XFEL pulse parameters enables to find absolute values of structure factors for 
both types of atoms and their relative phase. The accuracy of structural information is discussed in 
terms of fluctuations of the evaluated atomic scattering factors. Our approach could be important 
for improvement of phase retrieval methods with respect to a more efficient determination of atomic 
positions within the unit cell of macromolecules. 
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I. INTRODUCTION 


The solution of phase problem is one of the major tasks in X-ray structure analysis in order to determine the 
electron density distribution within the crystalline unit cell [T]. Numerous methods of direct and indirect solution 
of the phase problem exist but most of them are based on an initial guess of at least few of scattering phases and 
possible atomic positions of the studied unit cell [2]. The phase problem becomes especially complicated in case of 
crystals with large number of atoms in the unit cell such as in macromolecules and proteins [3]. Moreover, some of 
such crystals are very difficult to produce and they are available only in small sizes 10^ unit cells for the whole 
sample in 3D) that makes their investigation by means of conventional X-ray sources rather intricate. 

X-ray Free Electron Lasers (XFEL) are often denoted as 4th generation X-ray sources providing extremely bright 
and ultrashort X-ray pulses that give unique possibilities to study the structure of matter with angstrom resolution 
on a time scale of femtoseconds SIS]- These unique properties of the XFEL pulses led to the development of some 
qualitatively new methods, where the serial femtosecond crystallography (SFX) is the most prominent one. A set 
of recent experiments [6] proved SFX to be a perfect tool for investigation of small crystals (nanocrystals) assuming 
that structure data can be taken before sample destruction takes place on a time scale larger than the FEL pulse 
length. Refinement of the diffraction data and solving the phase problem in most of these experiments was performed 
on the basis of the method of molecular replacement that still needs the implementation of already known very 
similar structure entities. However, the problem of direct measurement of structural amplitudes is one of the most 
challenging issues of using an XFEL for X-ray structural analysis [T] . Some theoretical investigations in this direction 
considering the XFEL-specific properties have already been published by Son et al. (the high-intense version of the 
MAD approach) [7] and Spence et al. (approach based on analysis of scattering intensities between Bragg reflections 
from nanocrystals) [8] . In the present paper we show that diffraction of XFEL femtosecond pulses from crystals could 
also provide extra information which can be used as initial constraints in the phase retrieval procedures which are not 
available in the conventional X-ray structure analysis. 

The present paper is organized as follows. Secjn|describes the theoretical background for the procedure of extracting 
of some additional information about the crystal structure by analyzing the variations of Patterson maps caused by 
the evolution of electron density of heavy atoms of the studied sample irradiated by the XFEL fs-pulse. In Sec |III| we 
discuss the role of fluctuations that may appear due to the statistical dispersion of evolution paths and estimate the 


conditions under which the fluctuations can be neglected with sufficient degree of accuracy. In Sec|IV| we show the 
numerical results for the evolution of the electron density and separation of various forms of the Patterson maps for 
the arbitrary model crystal as a sample. 


II. THEORETICAL APPROACH 

In some extent our approach is analogous to the method of heavy atom in the X-ray diffraction analysis of proteins 
[9]. However, we do not assume that an additional heavy atom with known scattering factor is implanted artificially 
into the molecule under investigation. In order to explain the physical idea of our method let us consider the unit 
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cell of the crystal that includes natural groups of relatively heavy atoms (for instance, sulfur S or iron Fe) and light 
atoms (for instance, oxygen O, nitrogen N, carbon C or hydrogen H). For matter of simplicity let us consider a unit 
cell with only two subsystems that consist of atoms of the same species: light atoms (L) and heavy atoms (H). In 
equilibrium state the scattering factor of such a unit cell can be written as [TO]: 


Nl _ Nh 

= X] (1) 

Li = l Hi = l 

where q is the vector of momentum transfer, Nl, /l and r//., Nh, fn represent the coordinates, total number 
and atomic scattering factors (ASF) of L- and H-atoms within the crystal unit cell correspondingly; Sl and Sh are 
the structure factors of the L- and H-subsystems which values depend on the positions of corresponding atoms within 
the crystal unit cell. They are defined as follows: 


Nl Nh 

= Y Sniq) = Y ( 2 ) 

Li = l Hi = l 

Here we assume that the energy of the XFEL pulse is far from the atomic shell ionization values and, as a result, 
the anomalous dispersion corrections in ASF can be omitted, so that it can be regarded as a real quantity. 

In the framework of the kinematical theory the diffraction peak intensity is proportional to the square of absolute 
value of the form factor of the unit cell m- 


lui^ = A\Fi^f = + |5H(g)|cosA^(g-)) , (3) 

where A is a constant which does not depend on the atomic parameters, Aip{^ = arg[S'i;,(^] — dLiglSni^] is the 
relative phase between the structure factors of both subsystems within the unit cell. 

The Patterson function P(r) of the system can be obtained by inverse Fourier transform of the measured intensities 
[iniiia. Following m this function represents a vector map of all possible combinations of atomic pairs peaking 
at points that correspond to all possible interatomic vectors. The height of these peaks is proportional to the product 
of ASF of atoms within such a pairwise combination. The total number of peaks is N‘^ {N is the total number of 
atoms within the crystal unit cell): N peaks overlap at the origin of the cell and correspond to combination of every 
atom with itself), N{N — 1) are distributed within the unit cell and correspond to all other possible combinations of 
distances among the atoms. The main idea of the Patterson deconvolution consists in finding possible positions of 
heavy atoms and assuming some initial (trial) values for the phase angle of each reflection with respect to their basis. 
However, this task is often a tough challenge because some of these N{N — 1) peaks can overlap, and the highest 
peaks do not always correspond to heavy-heavy atom pairs (the problem becomes much more difficult with increasing 
N). The width of the Patterson peaks is also affected by the overlapping and is influenced by thermal smearing. 
Both leads to lower resolution of peak positions and makes a sensible interpretation of these peaks rather difficult. 
In order to improve the resolution within the Patterson map we suggest a method based on some specific features of 
interaction of XFEL femtosecond pulses with matter. 

As it was shown earlier m, during the propagation of XEEL pulse through the crystal the ASE of the cell becomes 
dependent on specific parameters such as frequency uj, duration r and intensity J due to the time evolution of the 
electron density: 


F{^ F{q, {k}) ^ J„(g, {k}), (4) 

where k represents a set of parameters of the XEEL pulse. 

According to our calculations m it is possible to set specific experimental parameters so that the variation of ASE 
of H-atoms is relatively big whereas the ASE of L-atoms will change slightly: 


fL{q,{K}) 

hi^ 


RL{q,{K}) 


1 , 


/g(g, {^}) 


= Rniq, {«}) < 1 - 


(5) 


The structure factors Sl and Sh remain unchanged because the atomic positions in a crystal are fixed in the fs 
time range. Hence, it is possible to derive the following expression for the diffraction intensities: 
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^ |5'l(^)|^ + /Ir(^-Rlf(g, {k}) + 2/L(^/i/(^i?H(g, {/t}) |S'l(^| |5//(^| cos . (6) 

The values to be defined here are \Sh{^\ and cosAip{^. Supposing one can use three different regimes of 

the pulse parameters corresponding to different sets of {/i:} and performs the corresponding diffraction experiments 
with noticeable contrast in the intensity distribution, then it will be possible to extract these unknown values solving 
the system of three equations having the form of ^ (one should note that one of these regimes can be chosen in 
such a way that there is no evolution of ASF at all - for example, in a low-ffuence mode). As a result, different 
Patterson functions for the L-L, L-H and H-H atomic combinations can be constructed which will enhance the initial 
resolution and will help to define the possible positions and phases of H-atom. A similar method of using several 
forms of Patterson maps extracted from the analysis of parametric X-ray radiation has been considered in m. 

The key step for implementing this scheme and checking the possibility of Patterson map separation consists in 
estimation of the Rl and Rh values. Besides, one should choose such a set of pulse parameters so that on the one 
hand condition could be fulfilled and on the other hand a reasonable contrast in intensity distribution functions 
should be obtained (otherwise the system of equations of type ^ becomes degenerated). The static values of ASF of 
L- and H-atoms can be analytically derived on the basis of effective charge model nans], dynamical values of ASF 
can be estimated on the basis of CEIX (” crystal evolution induced by X-ray”) code described in m by simulation of 
evolution of electronic density of the crystal consisting of atoms of two different types and for various pulse parameters. 


III. THE ROLE OF FLUCTUATIONS 

The major obstacle in solving the system of master equations of type ^ is connected with possible fluctuations in 
atomic population probabilities of heavy atoms within the unit cell. Although we assume that evolution of every atom 
within the crystal unit cell is independent from each other, the variation of evolution paths could bring severe noise 
to the Patterson map making the interpretation rather difficult. Let us estimate the magnitude of such fluctuations 
and their influence on the shape of the diffraction pattern. 

The most relevant quantity for the formation of the diffraction peak is the average value of ASF {f{qR)) describing 
the number of scattering electrons as a function of time and the momentum transfer q = , where 0 is the scattering 

angle and A is the photon wave length. This value becomes time-dependent due to the fast evolution of the electron 
density of atoms within the crystal unit cell. The statistical character of the ionization processes means that the 
ASF at the moment of time t is a random value which depends on the probabilities of finding a certain electron 
configuration of the atom Px{t) [131 [I?]. Let us define the average ASF {f{qR)) and its standard deviation A/(g, t) : 


A 

A/(g, t) = \/{P{q,t)) - {fiq,t))^, (7) 

A 

where /a(^) is the stationary ASF value for the atomic configuration A at the momentum transfer q. Since the 
anomalous dispersion term is omitted we do not consider the energy range close to the resonance energy. 

The calculation of the ASF value with probabilities Px{t) related to one cell is performed by use of the ergodic 
hypothesis m for a statistical ensemble of atoms within the whole crystal. It is also supposed that the fluctuations 
of ASF for atoms in different cells are not correlated. In this case the ASF dispersion contributes only to the X-ray 
diffuse scattering background and does not change the intensity of the coherent diffraction peaks. 

In the framework of the kinematical theory, the diffraction peak intensity is defined by the square of the ASF of all 
atoms within the crystal m- In the case of diffraction of XFEL pulse, the ASE values are fluctuating so that they 
should be averaged for all configurations: 


/ i-oo 

{I{q,t)) J{t)dt, 

-OO 

im t)) = A / a ?, t)f:{q, \, 


a b 


(8) 
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where J(t) is the normalized envelope function of the XFEL pulse and the summation is performed over the coordinates 
Ra^Rb of the atoms with ASF faiqR) and fbiqR) in all unit cells of the crystal. 

As it was mentioned above the average ASF and its mean-squared fluctuations are supposed to be the same for 
equivalent atoms in all unit cells and fluctuations of ASF for different atoms are not correlated. In this case one can 
use the following equality: 


{faiq,t)fb{q,t)) 


{faiq,t)) {fbiq,t)) + [{faiq,t)fb{q,t)) - {fa{q,t)) {fb{q,t))] = 
= {fa{q,t)) {ft{q,t)) + [^fa{q,t)f] 5ab- 


Substituting into (|^ one can obtain: 


(9) 


{i{q,t)) = A 


'^{faiq,t))' 


,iqRa 


^{Afa{q,t)f > . 


( 10 ) 


Summation in (10) over the coordinates Ra of all cells in the crystal and over atoms within every unit cell is being 


performed by the following expansion for Ra'. 


Ra =n^fj, j = 1,2, ...,cr, 


(11) 


where n is the translation vector of the crystal lattice, and Vj and a are the coordinate of the j-th atom and the total 
number of atoms within the crystal unit cell correspondingly (translation symmetry of the crystal means that ASF 
depends on j index only). 

As a result, equation ([1Q|) transforms into the following form: 


{i{q,t)) = {iuiq,t)) 


E' 




{iu{q,t)) = A 




^iqrj 


( 12 ) 


where {Iu{qR)) is the total averaged diffraction intensity of the crystal unit cell. 

The first term in (12) defines the coherent diffraction intensity in accordance with the following identity 


t\t 2 _ / ^ \ ^ 

{icoh{q,t)) = -^'^\iuih,t))Hq-h). 


(13) 


where N, V and h are the total number of unit cells, the volume and the reciprocal lattice vectors of the crystal. 

The second term in (12) defines the background intensity of the diffuse (non-coherent) scattering. The relation of 
the total number of the diffuse photons scattered into the diffraction peak over the number of the coherent quanta 
scattered in the same angular interval is defined by the following parameter m- 


m 


A E,(AF,(g,i))2 
{Icoh{q,t)) 


L {icohiqR)) 


(14) 


where L and a are the linear dimensions of the crystal and the unit cell correspondingly (here one has taken into 


account that under diffraction conditions Xr, 


a). 


As it was shown in our paper m, the relative value of the standard deviation of fluctuation of ASF during the time 
of exposure of the crystal to the XFEL pulse does not exceed 20%, so that even in the case of nanocrystals ^ < 0.01 
is expected. As a result, one can conclude that the second term in equation (12) can be omitted remaining a high 


degree of accuracy so that the following approximate expression for the averaged intensity can be deduced: 
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Figure 1: The unit cell (a) and its Patterson map (b) of the model L-H crystal 







(15) 


Equation for \Sh(^\ and cos A(/:?(^ that is similar to §) can be obtained on the basis of ( [T^ b y carrying 

out additional integration ^ over time with pulse-weight function J(t) and assuming that condition is fulfilled 
(i.e., the ASF value of L-atoms is time-independent): 


m) 


{lum 




= A (/|(^ \SL{^f + (/Ir(^) + 2fL{^{fH{^) |-5l(^| |S'/f(^| cos , 


(16) 


where /l(^) is the static value of ASF of neutral (non-ionized) L-atoms, and (/^(^) are defined in the 

following way: 


^+00 


ifnigA)) 

J — 00 

/ + 00 

-00 

One should note that both values of 0 can be calculated by means of the CEIX code 
as ASF itself. 


(17) 


with the same accuracy 


IV. NUMERICAL RESULTS FOR ATOMIC POPULATIONS 

In order to prove the applicability of the scheme (l4|-(§ (updated with ) and check the stability of the 

numerical algorithm for the cases of small contrasts an arbitrary L-H crystal with pre-defined positions of atoms was 
chosen as example (it is important to note that we tried to avoid any type of symmetry in the unit cell in order to 
investigate the general case of the model). In order to implement certain values of cross-sections and rates of the 
processes defining the evolution of the system the potassium and fluorine atoms have been selected as prototypes of 
H- and L-atoms correspondingly. The ratio of total electron charges of the constituting atoms is Zx/Zp ~ 2.1 that 
should be regarded as a valid condition for implementing the heavy-light atomic separation. FigjT] shows the real 
charge distribution function and the Patterson map (that was obtained numerically as a direct Fourier-transform of 
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Figure 2: Atomic population probabilities as a function of time: (a) L-atoms, (b) H-atoms. 




Figure 3: Relative ionization value for L-atoms and H-atoms as a function of energy (a) and fluence (b). 


the charge distribution function) for the unit cell of model L-H crystal (the spacial linear scale was chosen in the way 
so that the cell size is equal to unity): total number of L-atoms is 10, total number of H-atoms is 5. 

Simulation of electron density evolution was performed by means of the CEIX code m with the following input 
conditions: the XFEL pulse was specified for a photon energy of 7 keV, Gaussian shape with 5 fs EWHM and a 
photon number of 10^^ per pulse. EigJ^ shows an example of such a simulation regarding the distribution of atomic 
population probabilities as function of time (the beam focus spot was chosen to be 1 jim?). As one can see, almost 
all L-atoms remain in the neutral state, whereas more than half of H-atoms are being ionized and the most populated 
state by the end of the pulse is the ion state with charge +2. 

In order to find different evolution regimes that could give non-degenerate contrast in the intensity distribution 
function the fluence of the pulse was subject to variation by changing the beam focus. Photoionization of the inner 
atomic shells is the main driving process that causes subsequent decay cascades and the main quantity responsible for 
its intensity is the photoionization cross-section value. Eor the selected prototypes of the constituting atoms at 7 kev 
the corresponding cross-sections for K over E differ by more than factor 20, so that condition (§ was expected to be 
satisfied with high degree of accuracy. In order to check that we performed additional simulation to study the relative 
ionization value (the relative deviation of number of bound electrons in the ion from the number of bound electrons 
in the neutral state) as a function of incident photon energy (see Fig|^ for details) at fluence of 10^^ photons//im^. 
As one can see, at 7 keV there is almost no ionization in L-atoms, whereas 1-2 electrons are excited in every H-atom 
in average. 

As it was mentioned above, the fluence was chosen to be the main parameter of [n] variation. EigJ^ shows the 
number of bound electrons per atom as function of fluence. One can conclude that within the considered range of 
fluences of 10^^ — 10^^ photons//im^ our assumption regarding the absence of a major evolution of electron density of 
L-atoms remains valid, whereas for heavy atoms one can choose different evolution regimes that can define reasonable 
contrasts in the intensity distribution functions. 
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Figure 4: Separate Patterson maps for the constituting atoms: (a) L-L atoms, (b) L-H atoms, (c) H-H atoms 


In order to simulate intensity distribution functions on the basis of the charge distribution function we have chosen 
probe fluences with values of 0.5 x 10^^ and 1.0 x 10^^ photons//im^ and the corresponding table of 1331 reflections 
was constructed. Based on the obtained intensity distribution functions for 2 time-dependent and 1 time-independent 
case (low-fluence case that corresponds to the Patterson map shown on the Fig{^) a linear equation solver was 
implemented in order to calculate the absolute values of structure factors and the cosine of the relative phase between 
scattering factors of L- and H- atoms. As a result, the separate Patterson maps for L-L, L-H and H-H atomic 
combinations were constructed (see FigQ and they coincided with the result of the direct calculation from the known 
charge distribution function that proved stability of the equation solver. One should note that this solution did not 
consider possible fluctuations caused by the variation of some beam parameters like flux and energy. 


V. DISCUSSION AND CONCLUSIONS 

The approach shown in the present paper enables one to create a separate Patterson map for the group of heavy 
atoms, the ASF of which is sensitive towards the radiation of XFEL fs-pulses. This additional information can 
be implemented in direct methods of finding the phases that are based on the phase retrieval procedure [20]. For 
instance, extracting the group of the strongest reflections is the starting point in the iteration techniques based on 
implementation of tangent formula m- In the framework of the present approach such a group is extracted directly 
from the Patterson map of heavy atoms making implementation of the retrieval procedure easier than in the case of 
analysis of the full Patterson map of a macromolecule. It is important to stress that in the case of centrosymmetric 
crystals the present approach even allows one to clearly define the relative phase of the structure factors of light 
and heavy atoms (i.e., both the absolute value and its sign). Retrieval of the heavy atom positions and knowing 
this relative phase enables one to define positions of light atoms as well and reconstruct the total charge distribution 
function. In the case of non-centrosymmetric crystals the present approach allows one to define the absolute values 
of the relative phase and structure factors of both groups of atoms. 
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